# ImageNet Fine-tuning
Convnext Base 224 22k 1k
Apache-2.0
ConvNeXT is a pure convolutional model inspired by vision Transformer designs, pre-trained on ImageNet-22k and fine-tuned on ImageNet-1k, outperforming traditional Transformers.
Image Classification
Transformers

C
facebook
1,879
4
Vit Base Patch16 224
Apache-2.0
Vision Transformer model pre-trained on ImageNet-21k and fine-tuned on ImageNet for image classification tasks
Image Classification
V
google
4.8M
775
Beit Large Patch16 384
Apache-2.0
BEiT is a vision Transformer-based image classification model, pretrained in a self-supervised manner on ImageNet-21k and fine-tuned on ImageNet-1k.
Image Classification
B
microsoft
44
0
Deit Base Patch16 384
Apache-2.0
DeiT is an efficiently trained Vision Transformer model, pre-trained and fine-tuned on the ImageNet-1k dataset at 384x384 resolution, suitable for image classification tasks.
Image Classification
Transformers

D
facebook
442
3
Vit Large Patch16 224
Apache-2.0
Large-scale image classification model based on Transformer architecture, pre-trained and fine-tuned on ImageNet-21k and ImageNet-1k datasets
Image Classification
V
google
188.47k
30
Featured Recommended AI Models